Automatic Generation of Natural Language Parsers from Declarative Specifications
نویسندگان
چکیده
The parsing schemata formalism allows us to describe parsing algorithms in a simple way by capturing their fundamental semantics while abstracting low-level detail. In this work, we present a compilation technique allowing the automatic transformation of parsing schemata to executable implementations of their corresponding algorithms. Taking a simple description of a schema as input, our technique generates Java code for the corresponding parsing algorithm, including schema-specific indexing code in order to attain efficiency. Our technique is general enough to be able to handle all kinds of schemata for context-free grammars and other grammatical formalisms, providing an extensibility mechanism which allows the user to define custom notational elements.
منابع مشابه
Tools for Generation of Natural Inflected Language Processors
Supporting multiple languages and natural language processing are of high importance in information systems. This paper discusses software tools for the generation of languages processors (LPs) for the natural inflected languages. The tools are implemented in the LP generator DUAL, which allows for formal specification and reusability of developed components. The declarative language Dual is us...
متن کاملA Structured Interactive Workspace for a Visual Configuration Language
This paper shows how language technologies such as the automatic generation of parsers for analyzing user actions and visual parsing can be applied to build a flexible tool specialized in complex specification tasks, namely the configuration of distributed applications. The central issue is to propose to structure the workspace through a syntax of user actions on one hand, and a syntax of visua...
متن کاملTrading off Completeness for Efficiency --- The \textsc{ParseTalk} Performance Grammar Approach to Real-World Text Parsing
We argue for a performance-based design of natural language grammars and their associated parsers in order to meet the constraints posed by real-world natural language understanding. This approach incorporates declarative and procedural knowledge about language and language use within an object-oriented specification framework. We discuss several message passing protocols for real-world text pa...
متن کاملLooking for Errors: A Declarative Formalism for Resource-adaptive Language Checking
The paper describes a phenomenon-based approach to grammar checking, which draws on the integration of different shallow NLP technologies, including morphological and POS taggers, as well as probabilistic and rule-based partial parsers. We present a declarative specification formalism for grammar checking and controlled language applications which greatly facilitates the development of checking...
متن کاملThe ucsCNL: A Controlled Natural Language for Use Case Specifications
In general, test generation tools receive as input either requirements or use case specifications. However, in most companies these specifications are written in free natural language (NL), and the lack of standardization may become a problem for the generation tools and testers. A promising solution is to use a Controlled NL (CNL) to write software specifications. We present here the ucsCNL fo...
متن کامل